Superpoint Transformer for 3D Scene Instance Segmentation
نویسندگان
چکیده
Most existing methods realize 3D instance segmentation by extending those models used for object detection or semantic segmentation. However, these non-straightforward suffer from two drawbacks: 1) Imprecise bounding boxes unsatisfactory predictions limit the performance of overall framework. 2) Existing method requires a time-consuming intermediate step aggregation. To address issues, this paper proposes novel end-to-end based on Superpoint Transformer, named as SPFormer. It groups potential features point clouds into superpoints, and directly predicts instances through query vectors without relying results The key in framework is decoder with transformers that can capture information superpoint cross-attention mechanism generate masks instances. Through bipartite matching masks, SPFormer implement network training aggregation step, which accelerates network. Extensive experiments ScanNetv2 S3DIS benchmarks verify our concise yet efficient. Notably, exceeds compared state-of-the-art 4.3% hidden test set terms mAP keeps fast inference speed (247ms per frame) simultaneously. Code available at https://github.com/sunjiahao1999/SPFormer.
منابع مشابه
PixelLink: Detecting Scene Text via Instance Segmentation
Most state-of-the-art scene text detection algorithms are deep learning based methods that depend on bounding box regression and perform at least two kinds of predictions: text/nontext classification and location regression. Regression plays a key role in the acquisition of bounding boxes in these methods, but it is not indispensable because text/non-text prediction can also be considered as a ...
متن کاملClustering algorithms used in 3D scene segmentation
In this paper, we implement and compare three different clustering algorithms for the purpose of 3D image segmentation. Specifically, the K-means, Mean Shift, and Hierarchical methods are studied, and their performance is compared using cluster validity methods. Performance was analyzed in two ways, first by comparing independent results from each, and second, by comparing results where Hierarc...
متن کاملActive 3D Scene Segmentation Using Kinect
This thesis takes as a starting point a segmentation framework for visual scenes based on Markov Random Fields. The approach integrates several cues and considers the scene in 3D, improving the hypotheses in an iterative manner. In this thesis, we substitute the hardware previously used for a Kinect, a device capable of creating high quality depth maps of the environment. The main interest of t...
متن کاملA Stochastic Algorithm for 3D Scene Segmentation and Reconstruction
In this paper, we present a stochastic algorithm by effective Markov chain Monte Carlo (MCMC) for segmenting and reconstructing 3D scenes. The objective is to segment a range image and its associated reflectance map into a number of surfaces which fit to various 3D surface models and have homogeneous reflectance (material) properties. In comparison to previous work on range image segmentation, ...
متن کاملMultiple Rectangle Model for Vuildings Segmentation and 3D Scene Reconstruction
This paper introduces a method for automatic extraction of buildings in aerial images. We first present a method based on rectangular buildings, which are the most common constructions. After a rough segmentation, we estimate a criterion of similarity of each region with the best matching rectangle. Fur buildings of complex shapes, we introduce an iterative wag to divide a region in order to op...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2023
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v37i2.25335